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We study the existence of polynomial kernels for the problem of deciding feasibil- 
ity of integer linear programs (ILPs) , and for finding good solutions for covering and 
packing ILPs. Our main results are as follows: First, we show that the ILP Fea- 
sibility problem admits no polynomial kernelization when parameterized by both 
the number of variables and the number of constraints, unless NP C coNP/poly. 
This extends to the restricted cases of bounded variable degree and bounded num- 
ber of variables per constraint, and to covering and packing ILPs. Second, we give 
a polynomial kernelization for the Cover ILP problem, asking for a solution to 
Ax > b with c T x < k, parameterized by k, when A is row-sparse; this general- 
izes a known polynomial kernelization for the special case with 0/1-variables and 
coefficients (d-HiTTiNG Set). 

1. Introduction 

This work seeks to extend the theoretical understanding of preprocessing and data reduction 
for Integer Linear Programs (ILPs). Our motivation lies in the fact that ILPs encompass many 
important problems, and that ILP solvers, especially CPLEX, are known for their preprocessing 
to simplify (and shrink) input instances before running the main solving routines (see Atamtiirk 
and Savelsbergh [2j for a survey about modern ILP solvers). When it comes to NP-hard prob- 
lems, then, formally, being able to reduce every instance of some problem would give an efficient 
algorithm for solving it entirely, and prove P = NP (cf. |13|). We avoid this issue by studying 
the question for efficient preprocessing via the notion of kernelization from parameterized com- 
plexity [7j, which relates the performance of the data reduction to one or more problem-specific 
parameters, like the number n of variables of an ILP. 

A kernelization with respect to some parameter n is an efficient algorithm that given an input 
instance returns an equivalent instance of size depending only on n; a polynomial kernelization 
guarantees size polynomial in the parameter (see Section [2] for formal definitions). This notion 
has been successfully applied to a wide range of problems (see Lokshtanov et al. [TO] for a 
recent survey). A breakthrough result by Bodlaender et al. |3] (using [11]) gave a framework 
for ruling out polynomial kernels for certain problems, assuming NP ^ coNP/poly (else the 
polynomial hierarchy collapses) Q 
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ILP feasibility. Let us first discuss the Integer Linear Program Feasibility (ILPF) 
problem: Given a set of m linear (in) equalities in n variables with integer coefficients, decide 
whether some integer point x £ Z n fulfills all of them. A well-known result of Lenstra |18j 
gives an 0(a n m c ) time algorithm for this problem, later improved, e.g., by Kannan [16] 
to O(n ( n ^m c ). We can trivially "reduce" to size N = 0{n ^) by observing that Kannan's 
algorithm solves all larger instances in polynomial time 0(Nm c ) = N°^\ Can actual reduction 
rules give smaller kernels, for example with size polynomial in n? 

It is clear that we can store an ILPF instance, for example, (A, b), with A € Z mxn , b E Z m , 
asking for x > with Ax < b, by encoding all 0{nm) coefficients, which takes 0{nm log C) 
bits where C is the largest absolute value among coefficients. Let us check what can be said 
about polynomial kernels with respect to these parameters for ILPF and the r-row-spars^l 
variant r-ILPF: 

If the row- spar seness is unrestricted, then ILPF(n + C) and ILPF(m + C) encompass Hitting 
Set(ti) and Hitting SET(mjH, respectively, which admit no polynomial kernels jSJE]. What 
about ILPF(n + m), which is the maximal open case below the trivial parameter n + m + log C 
(see Figure [[])? 

With bounded row- spar seness r, things turn out differently: For r-ILPF(n + C) and r- 
ILPF(m + log C) there are polynomial kernels: The former is not hard and we briefly explain it 
in Appendix [Dj the latter is trivial since row-sparseness r entails n < r -m and hence the above 
encoding uses ©(ram log C) = 0(rm 2 log C) = (m + log C)°^ bits. It was showed previously 
that r-ILPF(n) admits no polynomial kernel [T7], and it can be seen that the proof works also 
for r-ILPF [n + log C)0 A gain, this leaves parameter n + m open (see Figure [2]). 

Our contribution for ILPF is the following theorem which, unfortunately, settles both ILPF(n+ 
m) and r-ILPF(n + m) negatively (see Section[3j). It can be seen that this completes the picture 
regarding the existence of polynomial kernels for ILPF and r-ILPF for parameterization by any 
subset of n, m, log C, and C. The same is true for the column-sparse case g-ILPF (see Figure [3]), 
but we omit a detailed discussion since it is quite similar to the row-sparse case. 

Theorem 1. ILPF(n + m) does not admit a polynomial kernelization or compression, unless 
NP C coNP/poly. This holds also if each constraint has at most three variables and each 
variable is in at most three constraints. 

It appears that ILPF(n + m) is the first problem for which we know that a polynomial 
kernelization fails solely due to the encoding size of large numbers in the input data (and taking 
into account our proof that no reduction is possible); an additional parameter logC would 
trivially give a polynomial kernel. This of course fits into the picture of hardness results for 
weight(ed) problems, e.g., W[l]-hardness of Small Subset Sum(/c) where the task is to pick 
a subset of at most k numbers to match some target sum [8] and the kernelization lower bound 
for Small Subset Sum(/c + d) where the value of numbers is upper bound by 2 d [6]; however, 
in both cases the number of weights is not bounded in the parameters. Furthermore, there 
are lower bounds for weighted graph problems (e.g., [15]), but there the used weights have 
value polynomial in the instance size and hence negligible encoding size. We also point out two 



2 Row-sparseness r: at most r variables per constraint; column-sparseness q: each variable occurs in at most q 

constraints; we use r and q throughout this work. 
3 Hitting Set: Given a base set U of size n, a set J of m subsets of U, and an integer k, find a set of k 

elements of U that intersects each set in T (if possible). ILP formulation: Is there {x) u &j with "}2 u£U x u < k, 

and J2ucf x « - 1 f° r au F e Tl 
4 The used cross-composition with t input instances creates coefficients of value 0(t 2 ) with encoding size log C = 
O(logt) which is permissible for a cross-composition. 
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contrasting positive results: A randomized polynomial compression of Subset SuM(n) [13] . 
and a randomized reduction of KNAPSACK(n) to many instances of size polynomial in n |21j 
(the number of instances depends on the bit size of the largest weight). 

Covering and packing ILPs. Given the overwhelming amount of negative results for ILPF, 
we turn to the more restricted cases of covering and packing ILPs (cf. [22]) with the hope of 
identifying some positive cases: 

(covering ILP:) min c T x (packing ILP:) max c T x 

s.t. Ax >b s.t. Ax <b 

x > x > 

Here A, b, and c have non- negative integer entries (coefficients). Feasibility for these ILPs is 
usually trivial (e.g., x = is feasible for packing), and the more interesting question is whether 
there exist feasible solutions x with small (resp. large) value of c T x. Encompassing many well- 
studied problems from parameterized complexity, we ask whether c T x < k (resp. c T x > k), 
and parameterize by k; instances are given as (A, b,c, k). Unsurprisingly, there are a couple of 
special cases contained in this setting that have been studied before (e.g., with 0/ 1-variables and 
coefficients); some of those are W[l]-hard (and are hence unlikely to have polynomial kernels), 
whereas others have positive results that we could hope to generalize to the more general ILP 
setting. To capture the different cases, we use the column-sparseness q and row-sparseness r of 
the matrix A: taking q and r as constants, additional parameters, or unrestricted values defines 
different problems. Our main result in this part is a polynomial kernelization for r-CovER 
ILP(fc) (see Section[3|); the special case of only 0/1 variables and coefficients is known as r- 
Hitting Set(/c) and admits kernels of size 0(k r ) [10[ H]. Our result also uses the Sunflower 
Lemma (like [10J), but the reduction arguments for sunflowers of linear constraints are of course 
more involved than for sets. 

Theorem 2. The r-CoVER IhP(k) problem admits a reduction to 0(k r2+r ) variables and 
constraints, and a polynomial kernelization. 

Furthermore, we show how to preprocess instances of Packing ILP (A; + q + r) and Cover 
ILP(/s + q + r) to equivalent instances with polynomial in kqr many variables and constraints 
(see Section [5]). For t-Packing ILP (A; + q) this is extended to a polynomial kernelization, 
generalizing that for the special case of bounded degree Independent Set(/c). To put these 
results into context, we provide an overview containing also the inherited hard cases in Tables Q] 
and [21 for completeness, a brief discussion of these cases can be found in Appendix O 

2. Preliminaries 

A parameterized problem over some finite alphabet E is a language PCS'xN. The problem V 
is fixed-parameter tractable if (x, k) € V can be decided in time f(k) ■ (\x\ + k)°( l \ where / is 
an arbitrary computable function. A kernelization for V is a polynomial-time algorithm that, 
given input (x,k), computes an equivalent instance (x',k') with |x'|,A/ < h(k) where h is some 
computable function; K is a polynomial kernelization if h(k) is polynomially bounded in k. By 
relaxing the restriction that the created instance (x' , k') must be of the same problem and allow 
the output to be an instance of any language (i.e., any decision problem) we get the notion of 
(polynomial) compression. Almost all lower bounds for kernelization apply also for this weaker 
notion. 
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Parameterized complexity of Packing ILP(fc) 
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Table 1: "PK" stands for polynomial kernel, "no PK" stands for no polynomial kernel unless 
NP C coNP/poly. All normal- font entries are implied by boldface entries. 



Parameterized complexity of Cover ILP(fc) 
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Table 2: "PK" stands for polynomial kernel, "no PK" stands for no polynomial kernel unless 
NP C coNP/poly. All normal- font entries are implied by boldface entries. 



We also use the concept of an (OR-)cross-composition of Bodlaender et al. [I] which builds on 
the breakthrough results of Bodlaender et al. [3J and Fortnow and Santhanam [11] for proving 
lower bounds for kernelization. 

Definition 1 ([!]). An equivalence relation fa on S* is called a polynomial equivalence relation 
if the following two conditions hold: 

1. There is a polynomial-time algorithm that decides whether two strings belong to the same 
equivalence class (time polynomial in |x| + \y\ for x,y € £*). 

2. For any finite set S C S* the equivalence relation 1Z partitions the elements of S into a 
number of classes that is polynomially bounded in the size of the largest element of S. 

Definition 2 ([!]). Let L C X* be a language, let 1Z be a polynomial equivalence relation 
on X*, and let V C X* x N be a parameterized problem. An OR- cross- composition of L into V 
(with respect to 1Z) is an algorithm that, given t instances x±, X2, ■ ■ ■ ,xt G X* of L that are fa- 
equivalent, takes time polynomial in Yl\=i \ x i\ an d outputs an instance (y,k) £ X* x N such 
that: 

1. The parameter value k is polynomially bounded in maxj \xi\ + log£. 
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2. The instance (y, k) is yes if and only if at least one instance Xi is yes. 
We then say that L OR-cross-composes into V . 

Theorem 3 ([!]). If an NP-hard language L OR-cross-composes into the parameterized prob- 
lem V , then V does not admit a polynomial kernelization or compression unless NP C coNP/poly 
and the polynomial hierarchy collapses. 

3. A kernel lower bound for I LPs with few coefficients 

In this section, we prove Theorem [IJ i.e., that Integer Linear Program Feasibility (n + m) 
admits no polynomial kernelization unless NP C coNP/poly. We begin with a technical lemma 
that expresses multiplication by powers of two in an ILP. The crucial point is that we need 
multiplication by t different powers of two, but can afford only 0(log c t) variables and coefficients 
(direct products of variables are not legal in linear constraints). 

Lemma 1. Let a, b, and p be variables, and let 6 max ,p max > be integers. Let I = |~logp max ~|. 
There is a system ofQt + 7 linear constraints with 21 — 1 auxiliary variables such that all integer 
solutions have < b < 6 max , < p < p m &x, and a = b ■ 2 P . Conversely, if these three conditions 
hold then feasible values for the auxiliary variables exist. The system uses coefficients with bit 
size 0(p max log 6 max ) and all variables have range at most {0, . . . ,6 max • 2 Pmax }. 

Proof. If 6 max = then constraints a = 0,6 = 0,0<p, and p < p max suffice to prove the 
lemma; henceforth 6 max > 1. Define a constant M = 6 max • 2 Pmax + 1; this will be the coefficient 
with the largest absolute value (bit size as claimed) and also exceed the value of all involved 
variables. To enforce this and the claimed bounds on b and p we add the following constraints. 

b > 0, p > 0, a > 0, 

b < 6 max , v < Pmax, a < 6 max • 2 Pmax (< M) 

Clearly, wg now have <c AI. Let £ — |~lo§Pmax 

] and add variables po, - ■ ■ ,Pe-i with 
range {0, 1} (this incurs 2£ constraints) together with a constraint 

l-i 

P = ^2 l Pu (1) 

which enforces that the pi form a binary encoding of p. Now, we can rewrite a = b ■ 2 P as follows 
(note that this is not added directly as a constraint). 

i-i 

a = b-2P = b- 2^"=o 2lft = b Yl 2 2 ^ (< M) 

Our strategy is to enforce the partial products aj = ^0^=0 ^ 2lpi < M, for all j £ {0, . . . ,£ — 1}, 
by enforcing aj = 2 23pj aj-i; for notational convenience we identify ae~i := a and a_i := b 
(the remaining ao, • • • ,a^-2 are new auxiliary variables). We add the following constraints for 
allj e {0,...,£-l}: 

dj > Oj— l (2) 
aj < 2 2J cij-i (3) 
dj + M- Pj M > 2 2 ° aj-i (4) 
o.j < Oj-i +PjM (5) 
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It suffices to show that a,j = 2 23p ^aj-i for all j € {0, . . . ,£— 1}. First, observe that constraints ^ 
and ([3j) restrict aj to range {aj_i, . . . , 2 2J aj-\}. Second, consider the effect of pj = respec- 
tively pj = 1: If pj = then constraint (|3]) is trivially fulfilled since M > 6 max • 2 Pmax > 2 2 ~ 'aj_i, 
whereas constraint ([5|) enforces that aj < aj-i', together we get aj = Oj_i = 2 7Pp iaj-\ as 
needed. If instead pj = 1 then constraint §5§ is trivially fulfilled whereas constraint (dJ en- 
forces aj > 2 2J aj_i; together we get dj = 2 2J Oj-i = 2 2 ^cy-i as needed. 

Thus, our set of constraints correctly enforces the intended partial products aj which implies 
that a = a^_i has to take the desired value b ■ 2 P , as claimed. 

For the converse, given the above discussion it is easy to check that setting po, . . . ,pt-i to 
the binary expansion of p and setting ao, • • • ,ae-2 to the values of the corresponding partial 
products gives a feasible assignment. □ 

Now we are set up to prove the first part of Theorem [TJ 

Lemma 2. Integer Linear Program Feasibility (n + m) admits no polynomial kerneliza- 
tion or compression unless NP C coNP/poly. 

Proof. We give an OR-cross-composition from the NP-hard Independent Set problem. The 
input instances are of the form (G = (V,E),k) where G is a graph and k < \V\ is an integer, 
asking whether G contains an independent set of size at least k. For the polynomial equivalence 
relation 1Z we let two instances be equivalent if they have the same number of vertices and 
the same solution size k. It is easy to check that this fulfills Definition [TJ For convenience we 
consider all input graphs to be on vertex set V = {1, . . . , n}, for some integer n. 

Let t ^-equivalent instances (Go = (V, Eq), k), . . . , (Gt-i = (V, E t -i), k) be given. Without 
loss of generality we assume that t = 2^ for some integer £ since otherwise we could copy one 
instance sufficiently often (at most doubling the input size and not affecting whether at least 
one of the given instances is yes). 

Construction. We now describe an instance of Integer Linear Program Feasibility 
that is yes if and only if at least one of the instances (Go = (V, Eq), k), . . . , (Gj_i = (V, £t-i), k) 
is yes for Independent Set. We first define an encoding of the t edge sets E p into Q) 
integer constants D(i,j), one for each possible edge {i,j} € (Y) (throughout the proof we 
use 1 < i < j < n): 

t-l 

D ihj) '■= ^2 2P ■ D {iJ,P), where D(i,j,p) := 

p=0 

In other words, the p-th bit of D(i,j) is one if and only if {i,j} is an edge of G p = (V, E p ). The 
values < D(i,j) < 2* — 1 will be used as constants in the ILP that we construct next. 

1. We start with a single variable p that is intended for choosing an instance number; its 
range is {0, . . . , t — 1}. The rest of the ILP will be made in such a way that it ensures that 
a feasible solution for the ILP will imply that instance (G p , k) is yes for Independent 
Set, and vice versa. 

2. Now we will add constraints that allow us to extract the necessary information regarding 
which of the possible edges {i, j} are present in graph G p . Recall that the p-th bit of the 
constant D(i,j) encodes this. For convenience, we will derive the needed constraints and 
argue their correctness right away. For each possible edge {i,j} with 1 < i < j < n we 
introduce a variable ejj with the goal of enforcing e^j = 1 if {i,j} G E p , and e^j = else. 



1 if {i,j}eE p , 
else. 
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Let i,j with 1 < i < j < n be fixed (we apply the following for all these choices). Clearly, 

p-i t-i 

s=0 s=p+l 

We are of course interested in the 2 p D(i,j,p) term, which takes value either or 2 P . Since 
the first term X^=o 2 s D(i,j, s) is at most 2 P - 1 and the last is a multiple of 2 P+1 , we will 
extract it via a constraint 

D(i,j) = a + f3 + 7 , (6) 

assuming that we can enforce the following conditions: (i) < a < 2 P — 1, (ii) j3 £ {0, 2 P }, 
and (hi) 7 € {0, 2 P+1 , 2 • 2 P+1 , . . .}. (Note that we use new variables otij,f3i t j, r yi ! j,6i ) j,£i t j 
for each choice of i and j, but in the construction the indices are omitted for readability.) 
This is where Lemma[T]comes into the picture as it permits us to enforce the creation of the 
required values and range restrictions without using overly many variables and constraints. 
For (i) we add a variable 5 and enforce S = 2 P by using Lemma [T] on a = 5, b = 6 max = 1, 
and p with p max = t — 1. We then add constraints a > and a < 5 — 1. For (ii) 
we apply the lemma on a = (3, b = e^j with 6 max = 1, and p with p max = t — 1, 
enforcing j3 = 2 p ei,j. (Note that we want to get e,j = D(i,j,p) from (3 = 2 P ■ D(i,j,p) 
anyway; this way it is already enforced.) For (iii) we add a new variable e > and 
apply the lemma on a = 7, b = 2e with 6 max = 2* — 1 (formally, this requires a new 
variable b 1 and constraint b' = 2e), and p with p max = t - 1, enforcing 7 = 2 p 2e = 2 p+1 e. 
The fact that there are no further restrictions on e effectively allows 7 to take on any 
multiple of 2 P+1 (and no other values). The upper bound 6 max on b = e for Lemma [1] 
comes from D(i,j) < 2* — 1 (we do not require larger values of 7 and e since 7 < D(i,j) 
and 7 = 2 p e). 

The most costly application of Lemma [1] is for (iii) where we have 6 max = 2* — 1 and p max = 
t — 1. This incurs C(logp max ) = O(logt) additional variables and constraints, and uses 
coefficient bit size 0(p max log 6 max ) = 0(t 2 ) (same bounds suffice also for (i) and (ii)). 
Thus, over all choices of 1 < i < j < n, i.e., for getting all the needed edge information, we 
use 0(^2) logi) additional variables and constraints. For each variable e^j our constraints 
ensure that it is equal one if {i,j} is an edge in G p ; else it has value zero. 

3. Now we can finally add the actual edge constraints needed to express the Independent 
Set problem. We add n variables x±, . . . ,x n with range {0, 1}, one variable X{ for each 
vertex i € V. For each possible edge £ ( 2 ), i.e., for all 1 < i < j < n we add the 

following constraint. 

Xi + Xj + e it j < 2 (7) 

Finally, we add a constraint 

n 

Y^i>k (8) 

i=\ 

to ensure that we select at least k vertices. This completes the construction. 

Let us now check the number of variables and constraints in our created ILP. The number of 
variables is dominated by the 0((^j logt) variables added in Step [21 which come from 0{(^)) 
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applications of Lemma [TJ The same is true for the number of constraints used. Thus both pa- 
rameters of our target instance are polynomially bounded in the largest input instance plus log t. 
Since we postulated no further restrictions on the target ILP (e.g., A and b may have negative 
coefficients), we will omit a discussion of how to write all constraints as Ax < b with x > 
(here x is the vector of all variables used) since that is straightforward. The largest bit size 
of a coefficient is 0(t 2 ) hence it is easy to see that the whole ILP can be generated in time 
polynomial in the total input size (which is roughly 0{t ■ n 2 ) from the t Independent Set 
instances with n vertices each). It remains to argue correctness of the construction. 

Soundness. Assume that the created ILP has at least one feasible solution and pick according 
values for all variables. The goal is to show that instance number p is yes for Independent 
Set. To this end, let S C V denote the set of vertices i for which X{ = 1. By ([8]) we know 
that 15*1 > k. Now, for any two vertices i,j G S with i < j it suffices to show that {i,j} is not 
an edge of the graph G p . By ([7]) and i, j E S we know that e^j = since x,- t + Xj + ey < 2. As 
we already showed in the construction that ej j must equal the p-th bit of D(i,j) this implies 
that {i, j} ^ E p , as needed. 

Completeness. Now, assume that for some p* € {0, . . . ,t — 1} the instance (G p * = 
(V,E p *),k) is yes for Independent Set and let S C V be some independent set of size k 
in G p *. Set p = p* and assign the following values to the other variables to get a feasible 
solution for the ILP: 

• Set the variables Xi to one for all i € S and to zero for i 6 V \ S. Clearly this fulfills @. 

• Set each edge variable aj to one if and only if {i, j} is an edge of G p (and zero else). It is 
easy to see that together with the x-variables this fulfills all constraints ([7D since we only 
have Xi = Xj = 1 if {i,j} is not an edge in G p (by choice of S). 

• For each choice of 1 < i < j < n, set 8ij = 2 P and set 

p-i 

s=0 

t-i 

7iJ = 2 s £>(U,s). 

s=p+l 

Thus < Qjj- < 2 P — 1 = 5ij — 1 and /3 = 2 p eij- Also, clearly, our choices for a, /3, and 7 
fulfill ©. 

As jij is a multiple of 2 P+1 we can set Eij — 2pVi to agree with 7 jj — 2 p ~^^Eij that we 
enforced via Lemma [TJ Similarly, Sij = 2 P and = 2 p ejj agree with what we enforced 
by the lemma. Thus, for all three applications of the lemma (per possible edge {i,j}) it 
is guaranteed (by the lemma statement) that feasible values for all auxiliary variables can 
be found. 

It follows that the constructed ILP instance has a feasible solution, as required. This completes 
the correctness part. 

Thus we have an OR-cross-composition from the NP-hard Independent Set problem to the 
Integer Linear Program Feasibility^ + m) problem. By Theorem El this implies that 
Integer Linear Program Feasibility (n + m) has no polynomial kernel or compression 
unless NP C coNP/poly. □ 
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The second part of Theorem [T] is now an easy corollary. Similarly, we get lower bounds for 
covering and packing ILP with parameter n + m; the proofs are postponed to Appendix IA.1I 
and IA.21 Getting the sparseness for Corollary [T] is easier than what was needed for r-ILPF(n) 
in [IT] since both number of constraints and number of variables are bounded in the parameter 
value. 

Corollary 1. Integer Linear Program Feasibility + m) restricted to instances that 
have at most 3 variables per constraint (row- sparseness) and with each variable occurring in 
at most 3 constraints (column- sparseness) admits no polynomial kernel or compression unless 
NP C coNP/poly. 

Corollary 2. Cover ILP (n + m) and Packing ILP (n + m) do not admit polynomial kernel- 
izations or compressions unless NP C coNP/poly. 

4. Polynomial kernelization for row-sparse Cover ILP(fc) 

In this section, we prove Theorem [2j by giving a polynomial kernelization for r-CovER ILP(/c), 
generalizing polynomial kernelizations for t-Hitting Set(&?) [1Q|. IT]. Our result uses the sun- 
flower lemma (stated below) that can also be used for t-Hitting Set (as in [10]). However, the 
application is complicated by the fact that the replacement rules for a sunflower of constraints 
are not as clear-cut as for sets in the hitting set case: Constraints forming a sunflower pairwise 
overlap on the same variables but with (in general) different coefficients; hence no small replace- 
ment is implied. Additionally, we have to bound the number of constraints that have exactly 
the same set of variables with nonzero coefficients, called scope, since the sunflower lemma will 
only be applied to the set of different scopes. The main work lies in the proof of the following 
lemma; the polynomial kernelization is given as a corollary. 

Lemma 3. The r-CovER ILP (k) problem admits a polynomial-time preprocessing to an equiv- 
alent instance with 0(k r +r ) constraints and variables. 

Before we turn to the proof, we give a lemma that captures some initial reduction arguments 
including a bound on the number of constraints with the same scope. 

Lemma 4. Given an instance (A,b,c,k) of r -Cover ILP (7c J we can in polynomial time reduce 
to an equivalent instance (A',b',c',k) such that: 

1. No constraint is satisfied if all variables in its scope are zero, i.e., b\ > 1. 

2. The cost function c' T is restricted to 1 < < k. 

3. All feasible solutions with c' T x < k have Xi € {0, . . . , k} for all i. 

4- There are at most (k + l) d constraints for any scope of at most d variables. 

Proof. Recall that we are considering covering constraints of the form A[i, -]x > b[i]. Constraints 
that are satisfied when all their variables are zero, are satisfied by all x > and may be discarded 
(as we already require x > 0). If q = for some variable Xi then we can set Xi to arbitrarily 
large values and thus satisfy all constraints containing it (here we use monotonicity of covering 
constraints); we may thus safely delete any such variable and all constraints containing it. 
Similarly, if q > k for some Xi then already when setting Xi > 1 we cannot get cost c T x at most k\ 
thus we delete such variables (but not their constraints). Consequently, we get 1 < q < k for 
all remaining variables X{. This also limits the possible values with c T x < k to X{ € {0, ... , A;}. 
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Finally, consider any set of d variables with more than [k + l) d constraints having exactly 
this scope. It is clear that there are only (k + l) d possible assignments to the variables that do 
not violate the maximum cost of k. Each constraint can rule out some of those. It is therefore 
sufficient to keep only one constraint for each infeasible assignment, as all further constraints 
are redundant and may be deleted, giving the claimed bound. Note that each constraint has at 
most r variables in its scope, hence we can perform this reduction in time polynomial in n. □ 

We recall sunflowers and the sunflower lemma of Erdds and Rado [9] . 

Definition 3 ([2]). Let T denote a family of sets. A sunflower in T of cardinality t and with 
core C is a collection of t sets {Fi, . . . ,Ft} C T such that F^ n Fj = C for all i ^ j. The 
sets Fi \C, . . . ,Ft\C are called the petals of the sunflower; they are pairwise disjoint. The 
core C may be empty. 

Lemma 5 (Sunflower Lemma JS]]). Let T denote a family of sets each of size d. If the cardinality 
of T is greater than dl-(t—l) d then T contains a sunflower of cardinality t, and such a sunflower 
can be found in polynomial time. 

Proof of Lemma\^ To begin with, we apply Lemma E] in polynomial time. Afterwards, for 
each constraint scope with d < r variables, there are at most (k + l) d = 0(k r ) constraints 
with that scope. Now, we will apply the sunflower lemma to the set of all scopes of size d, for 
each d G {1, . . . , r}. If there are more than d\ ■ (t — l) d = 0(t r ) sets then we find a sunflower 
consisting of t sets (scopes). We will show how to remove at least one constraint matching one 
of the scopes, when t is sufficiently large (and polynomially bounded in k). 

If, instead, the number of scopes is at most d\ ■ (t — l) d (for all d) then we can bound the total 
number of constraints as follows: We have r choices for d G {1, . . . , r}. For each d we have at 
most d\ ■ (t — l) d = 0(t r ) constraint scopes. For each scope there are at most 0(k r ) constraints. 
In total this gives a bound of r • 0{t r ) ■ 0{k r ) = 0(t r ■ k r ). Since each constraint has at most r 
variables we get the same bound (in O-notation) for the number of variables. 

Removing a constraint. Let us now see how to find a redundant constraint when there 
are more than d\ ■ (t — l) d constraint scopes for some 1 < d < r. We will also derive an 
appropriate value for t (at least t > k). Consider a f-sunflower in the set of constraint 
scopes of size d (which we can get in polynomial time from the sunflower lemma). Let its 
core be denoted by C = {x\, . . . , x s }, with < s < d < r, and its pairwise disjoint petals 
by {yi^+i, . . . , yi,d}, • • • > {yt,s+i, ■ ■ ■ , Ut,d}- (Note that s < d is needed since all sets are differ- 
ent, which requires nonempty petals; else they would all equal the core.) Thus there must be 
constraints in the ILP matching these scopes. We arbitrarily pick one constraint for each scope: 

«i,i^i + (11,2X2 + • • • + ai yS x s + ai, s+ iyi jS+ i + . . . + a\^Vi,d > h 
a 2 ,ixi + a 2 ,2X2 + • • • + a 2 , s x s + a 2 , s +iy2,s+i + • • • + a 2t dV2,d > b 2 

at,\X\ + a t)2 x 2 + . . . + a t)S x s + a t , s+ iyt, s +i + ■■■ + at,d,yt,d > h 

Note that to keep notation simple the indexing of the variables and coefficients is only with 
respect to the sunflower and makes no assumption about the actual numbering within Ax > b; 
all arguments are local to the sunflower. 

First, let us note that we may return no (or a dummy no- instance) if the core is empty 
and t > k: That would give us more than k constraints on disjoint sets of variables that each 
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require at least one nonzero variable; this is impossible at maximum cost k. In the following, 
assume that s > 1. 

Since each variable takes values from {0, 1, . . . ,k} (by Lemma HD, there are at most {k + l) s 
possible assignments for the s core variables (in fact there are even less since the sum is at 
most k). It is clear that assigning zero to all core variables does not lead to a feasible solution 
since each of the t constraints requires at least one nonzero variable and the petal variables 
are disjoint. However, unlike for t-Hitting Set, assigning one to a single core variable is not 
necessarily sufficient to satisfy all constraints, and the value of each variable for a constraint 
might be quite different. Thus, we may not simply replace the constraints of the sunflower by 
the restriction of one constraint to the core variables. 

To cope with this difficulty we employ a marking strategy. We check all (k + l) s assignments 
of choosing a value from {0, l,...,k} for each core variable. It is possible that some of the 
constraints are already satisfied by this core assignment due to the monotone behavior of cov- 
ering constraints. If more than k constraints need an additional nonzero variable (which would 
be a petal variable) then clearly this core assignment is infeasible. In this case we arbitrarily 
mark k + 1 constraints that are not satisfied by the core assignment alone (i.e., if all their other 
variables would be zero); these serve to prevent the core assignment from being chosen. If at 
most k constraints are not yet satisfied then we mark all of them. Clearly, in total we mark at 
most (k + 1) • {k + l) s constraints. 

We will now argue that all unmarked constraints can be deleted. Clearly, deletion can only 
create false positives, so consider a solution to the instance obtained by deleting any unmarked 
constraint. If the assignment does not also satisfy the removed constraint (where we take value 
zero for variables that do not occur after deletion) then, in particular, the same is true for the 
core assignment made by this assignment. Hence, while marking constraints with respect to 
this core assignment we must have marked k + 1 other constraints (that are also not satisfied 
by the core assignment alone), which are hence not deleted. However, these other constraints 
cannot all be satisfied with a budget of at most k, a contradiction. Thus, deleting all unmarked 
constraints is safe. 

Therefore, if we have a sunflower with core size s and more than (k + 1) • (k + l) s constraints 
then our marking procedure allows us to delete at least one constraint. Thus, allowing for core 
size s up to r — 1, we set t = {k + 1) • + l) r_1 + 1 = 0(k r ). While we can find t-sunflowers (via 
the sunflower lemma) we can always delete at least one constraint. Our earlier discussion at 
the start of the proof now gives a bound of 0(t r ■ k r ) = 0(k r +r ) on the number of constraints 
and variables achieved by this reduction process. This completes the proof. □ 

Corollary 3. The r-CovER ILP (k ) problem admits a polynomial compression to size 0{k r +2r ) 
and a polynomial kernelization. 

Proof (sketch). We know how to reduce to 0(k r2+r ) constraints in polynomial time. Since each 
constraint can equivalently be described by the infeasible assignments that it defines for the 
variables in its scope, we may encode the instance by replacing each constraint on d < r variables 
by a 0/1-table of dimension (k + l) d : Each entry means that the assignment corresponding to 
this coordinate is infeasible (e.g., entry at (2,3,5) tells whether x\ = 2, xi = 3, and x% = 5 is 
feasible for this constraint, when (xi,X2,X3) is its scope); each 1 stands for a feasible entry. For 
each constraint this requires (k + l) d = 0(k r ) bits for the table plus r ■\og{k r +r ) = C(log k) bits 
for encoding the scope (there are at most r variables, each one encoded by a number from 1 
to 0(k r +r )). The cost function c T x requires only the storing of 0(k r +r ) integers with values 
from to k, taking O(logfc) bits each. In total this gives the claimed size for the compression. 
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To get a polynomial kernelization let us first observe that the problem described above is 
clearly in NP since feasible solutions (certificates) can be verified in polynomial time. Thus, by 
an argument of Bodlaender et al. [5], we get an encoding as an instance of t-Cover ILP(fc) by 
using the implicit Karp reduction whose existence follows from the fact that r-CoVER ILP(fc) 
is complete for NP. The size of this instance is polynomial in 0(k r +2r ), and hence polynomial 
in k. This completes the polynomial kernelization. □ 

5. Further results 

In this section we state some results for Packing ILP(/c) and Cover ILP(&;) with column- 
sparseness q and row-sparseness r as additional parameters or constants; the proofs are post- 
poned to App endix IB . 1 1 and IB . 2 1 

Theorem 4. Packing ILP (k + q + r) is fixed-parameter tractable and admits a polynomial- 
time preprocessing to O(kqr) variables and 0(kq 2 r) constraints. For r-PACKlNG ILP (7c + q) 
this gives a polynomial kernelization. 

Theorem 5. Cover ILP (k + q + r) is fixed-parameter tractable and admits a polynomial-time 
preprocessing to O(kqr) variables and 0(kq) constraints. (A polynomial kernelization for r- 
Cover ILP (7c + q) follows from Corollary^) 

6. Conclusion 

We have studied different problem settings on integer linear programs. For Integer Linear 
Program Feasibility parameterized by the numbers n of variables and m of constraints we 
ruled out polynomial kernelizations, assuming NP ^ coNP/poly. This still holds when both 
column- and row-sparseness are at most three. Adding further new and old results, this settles 
the existence of polynomial kernels for ILPF and r-ILPF for parameterization by any subset 
of {n, m, log C, C} where C is the maximum absolute value of coefficients. 

Regarding covering and packing ILPs, we gave polynomial kernels for t-Cover ILP(/c), gen- 
eralizing r-HiTTiNG Set(/c), and for t-Packing ILP(fc + q). Further results and observations 
give an almost complete picture regarding q and r except for the question about polynomial ker- 
nels for parameter k + q+r. We recall that for both problems one can reduce to n, m < (kqr)°^ 
but parameterization by n and m only admits no polynomial kernelization (Corollary [2]). 
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A. Omitted proofs from Section [3] 
A.l. Proof of Corollary Q] 

Proof (sketch). It suffices to show how an instance of Integer Linear Program Feasi- 
bility^ + m) can be modified to fulfill the extra spareness restrictions, while not blowing 
up the numbers of variants and constraints to more than (n + m)°^ (formally this consti- 
tutes a polynomial parameter transformation from the unrestricted to the sparse case). We 
make no particular assumption about the constraints other than demanding that they are ex- 
pressible as Y^i=\ a i x, i ^ where the X{ are the variables, a^s and b are integer coefficients, 
and ME {<,>,=}• By replacing the sums with partial sums Sj = Yli=i a i x i computed via 
constraints Sj = Sj-i + ctjXj (or, formally, Sj — Sj-± — otjXj = 0), we can reduce the number 
of variables per constraint to three. Per original constraint this requires 0(n) new variables 
for partial sums as well as 0(n) constraints, for a total of 0{nm) additional variables and con- 
straints. To reduce also the number of occurrences of each variable, it suffices to make copies 
of the variables. Conveniently, the variables for the partial sums occur only two times each 
(intuitively, once for forcing their own value and once for enforcing the next value, resp., for 
the occurrence of s n in s n < b). Each original variable occurs at most once in each of the m 
original constraints. When moving to the partial sums, this is replaced by at most one occur- 
rence for computing the partial sum needed for a given constraint; thus it suffices to have m 
copies a?i(l), . . . ,Xi(m) to use in the partial sum constraints. Clearly, this can be achieved via 

Xi = Xi(l) Xi(l) = Xi(2) ... Xi(m - 1) = Xi(m). 

This gives two occurrences per variable Xi{j) with the third being in place of X{ in a partial sum 
constraint. We use an additional 0(nm) variables and 0(nm) further constraints for this part. 

It is easy to see that we do not affect feasibility of the ILP since we only make equivalent 
replacements, respectively introduce copies of variables. In total we use 0(nm) additional 
variables and constraints, meaning that the final row- and column-sparse ILP has polynomial 
in n + m many variables and constraints as needed. If we would have a polynomial kernel or 
compression for this problem, then combined with the present reduction this would constitute 
a polynomial compression for the Integer Linear Program Feasibility (ra + m) problem 
that we started with, implying NP C coNP/poly. This completes the proof. □ 

A.2. Proof of Corollary [2] 

Proof. We prove the corollary for the case of Cover ILP (n+m); the case of packing ILPs works 
along the same lines. To this end, we start from the instance created in the proof of Lemma [2] 
and turn it into a covering ILP plus a specified bound on the maximum target function value. 
The main properties that we require is that each variable in the created ILP (for which we ask 
for feasibility) has an upper bound on its range, e.g., p < t — 1 and fyj < 2*, and that all 
variables are non-negative. For convenience let us write B{z) for the upper bound on the range 
of any variable z (irrespective of the actual names used in the proof of Lemma [2]). 

The instance created in the proof of Lemma asks whether a given set of linear constraints 
admits a feasible point with integer coordinates. For all involved variables z we have < 
z < B(z) for some a priori known bound B(z). Let n denote the number of variables, and 
let z\, . . . ,z n be an enumeration of the variables. Each constraint can be written as 

n 

^ctiZi < b, 
i=i 
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where ME {<, >, =} and where b and each on are integer constants. Obviously, each equality 
constraint can be split into one < and one > constraint (with the same coefficients). Subse- 
quently, we may turn each < constraint into a > constraint be multiplying all its coefficients 
by -1. 

It remains to handle the fact that we may have negative coefficients. This is where the bounds 
on the variables come in. For each variable zi with upper bound B{zi) we add a constraint 

Zi + zt> B(zi), (9) 

where Zi is a new variable, constrained to non-negative values by z% > 0. To ensure that we 
actually get Zi + z^ = B(zi), we let our target function be 

n n 

min ^Zi + ^Zi- (10) 
i=l i=l 

By asking for a feasible solution of cost (target function value) at most Yl7=l B(zi) we ensure 
that all constraints (|9j) must be fulfilled with equality (these constraints enforce a lower bound 
of Yl?i=i-B{zi) on the cost). Thus, for all i, we have z\ + i{ = B{zi). This permits us to 
replace all occurrences of z^ with negative coefficient, say —7 where 7 > 1, by plugging in z^ = 
B(zi) — zi. We get —7^ = —^fB(zi) + 7^; moving —^B{z,j) into the RHS of the inequality 
(i.e., adding ^B(zi) to both sides) we have thus reduced the number of negative coefficients by 
one. Afterwards, should there be a negative RHS coefficient, then we may return a dummy no 
instance. (Note however that we only make equivalent transformations so if there was a feasible 
point for the initial ILP then this translates naturally to a feasible solution for the obtained 
covering ILP.) 

When all this is done, we have only > constraints with non-negative coefficients, and all 
variables are restricted to non- negative values. Thus we have obtained an equivalent covering 
ILP with 2n variables and at most n + 2m constraints, which transfers the kernelization lower 
bound from Lemma [2] also to Cover ILP(n + m). The proof for Packing ILP(n + m) is 
analogous. □ 

B. Omitted Proofs from Section [5] 
B.l. Proof of Theorem [4] 

We give two lemmas that together imply Theorem [H 

Lemma 6. Let an instance (A,b,c,k) of Packing IhP(k) be given and let q and r denote 
the column- and row-sparseness of A. In polynomial time we can compute an equivalent in- 
stance (A' ,b' ,c' ,k) such that 

1. A' has dimension at most kq 2 r x kqr, i.e., the new ILP has at most kq 2 r constraints and 
at most kqr variables. 

2. 1 < Q < k — 1, i.e., each increase in a variable contributes a value of at least one and at 
most k — 1. 

3. A' is a submatrix of A, and b' and c' are subvectors of b and c respectively (this means 
also that column- and row-sparseness of A' are at most the same as for A). 
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Proof. Let us begin with a few simple observations: First, we may delete all variables Xi 
with Q = since we have no increase in value c T x for setting X{ to a nonzero value but it may 
make constraints harder to fulfill; it is crucial that all constraints are of the form A[i, -]x < hi. 
Second, we check for variables xi such that setting X{ = 1 already makes some constraint 
infeasible, i.e., the assignment defined by 

fl if J = », 

Xj = < 

10 else, 

is infeasible. We may then delete xi since it cannot contribute to c T x (for feasible x). Finally, 
if Cj > k for any Xi then we may answer yes since we already know that we can set Xi to one 
(all others to zero) which gives c T x > k. Henceforth 1 < c < k — 1. 

Now, let us see how to get the upper bounds of kq 2 r and kqr on the number of constraints 
and variables. We greedily select variables such that no two appear in a shared constraint; 
we end up with a set S of variables. If IS") > k then assigning one to each variable in S is 
feasible (since each constraint "sees" only one variable with value one), and the pay off CiX{ per 
variable xi is at least one, so the value of this is at least k. Thus, if |5| > k then we may answer 
yes. Otherwise, if IS") < k then every variable not in S must be in a shared constraint with 
some variable x% € S. Each such constraint has at most r — 1 other variables beyond Xi and 
each Xi € S appears in at most q constraints. Thus the total number of variables is at most 

\S\ + (r — 1)9|<5| < kqr. 

Since each variable appears in at most q constraints this also limits the number of constraints 
to at most kq 2 r. (If some constraint has bi > but no variables with nonzero coefficients then 
of course the answer is no instead.) Since we only delete variables and constraints from the 
original instance it is clear that, e.g., A' is a submatrix of A. □ 

According to the lemma we can always reduce to at most kq 2 r constraints and at most kqr 
variables. Thus, for Packing ILP(/c + q + r), i.e., parameterized by k, q, and r, this constitutes 
a preprocessing to an equivalent instance with a polynomial number of coefficients. This also 
means that the problem is fixed-parameter tractable by an application of Lenstra's [18J algo- 
rithm for bounded dimension ILPs. When r is constant then this reduction gives a polynomial 
kernelization, as we show next. 

Lemma 7. r-PACKlNG ILP (k + q) admits a polynomial kernelization. 

Proof. To begin, we apply the reduction given by Lemma El With the goal of finding a feasible 
point x with c T x > k it is clear that it suffices to use a maximum range of {0, . . . ,k} for each 
variable Xi (if x is feasible then so are all x' with < x' < x, and a single position of value k 
suffices). Furthermore, as for r-CovER ILP (A;), any constraint on r variables excludes some 
of the |{0, ... ,k}\ r possible assignments. Thus we can follow the same argumentation as in 
the proof of Corollary [3] for encoding the feasible points: For each of the kq 2 r constraints, we 
encode in 0(r \og(kqr)) = 0(\og(kq)) the names of its r variables, followed by a binary array 
of bit size |{0, . . . ,k}\ r = 0(k r ) that encodes for all points in {0, . . . , k} r whether they are 
feasible (according to this constraint). Finally, we add an encoding of the target function, i.e., 
a sequence of kqr values from {0, . . . ,k — 1} which takes 0(kqlog k). This gives a compression 
to bit size 

Oikq 2 r ■ (k r +log(fcg)) + kqlogk) = 0(k r+1 q 2 + kq 2 \og(kq)) = {k + q)°V> . 
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Again, it can be observed that the encoded problem is in NP, as a candidate solution x can be 
efficiently verified (solution value and consistency with all local lists of feasible points). Thus, 
by a Karp reduction back to r-PACKlNG ILP (A; + q) we get a polynomial kernelization, as the 
output size is polynomial in the compression obtained above. □ 

B. 2. Proof of Theorem |5] 

Now, we will address Theorem [SJ The arguments are analogous to those needed for Theorem H] 
so we will focus on the differences. 

Lemma 8. Let an instance (A,b,c,k) of Cover ILP (7c J be given and let q and r denote the 
column- and row-sparseness of A. In polynomial time we can compute either an equivalent 
instance (A',b',c',k) such that 

1. A' has dimension at most kq x kqr, i.e., the new ILP has at most kq constraints and kqr 
variables. 

2. 1 < c < k, i.e., each increase in a variable contributes a cost of at least one and at most k. 

3. A' is a submatrix of A, and b' and c' are subvectors of b and c respectively (this means 
also that column- and row-sparseness of A' is at most the same as for A ). 

Proof. The reduction to c > 1 and b < 1 follows analog to Lemma [H If Cj > k for any variable Xi 
then we cannot afford to set it to one (or larger) and may delete it from the ILP. Now, each 
remaining variable Xi that is set to Xi > 1 costs at least one and each remaining constraint 
requires such a variable. Since we have a total budget of k and each variable appears in at 
most q constraints this limits the number of constraints to at most kq (else we return no). The 
number of variables is at most kqr since each constraint contains at most r variables. Clearly, 
all required operations take only polynomial time and the produced matrix A' and vectors b' 
and c' are submatrices respectively subvectors of A, b, and c. □ 

Thus the Cover ILP (A; + q + r) problem can be reduced to at most kq constraints and at 
most kqr variables, i.e., to a polynomial number of coefficients. Regarding polynomial kernels 
for r-CovER ILP(/c + q) we refer to our more general result for r-CoVER ILP(/c) by Lemma[3] 
and Corollary [3j Nevertheless, if the column-sparseness q is indeed small, it is of course possible 
to obtain a polynomial compression and kernelization via Lemma [5] combined with a direct 
analog of Lemma [7] (the encoding is entirely the same with slightly better bounds due to fewer 
constraints and variables; the interpretation of the encoded problem is of course as a covering 
problem) . 

C. Basic intractable cases for covering and packing I LPs 

In this section we briefly discuss the hardness results that can be observed from various hard 
problems being captured as packing or covering ILP problems. The first two are immediate; we 
spell out the ILP for W[l]-hardness of r-PACKiNG ILP(fc) for completeness, but omit a detailed 
discussion. 

Proposition 1. The r-PACKlNG ILP (7c J problem is hard for W[lJ for all r > 2 by reduction 
from Independent Set (7c,). 
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Proof. Given an instance (G = (V,E),k) for Independent Set(/c), make an indicator vari- 
able x v for each vertex v €V, and create the following ILP: 

max 1 x 
s.t. x u + x v < 1 V{u, v} £ E 
x > 

where 1 denotes the all one vector of dimension \V\. The created Packing ILP(/c) instance 
asks for a feasible point x with value l T x at least k. □ 

Proposition 2. The Cover ILP(k) problem is hard for W[2] by reduction from Hitting 

SETffcJ. 

If we restrict the column-sparseness to any constant q > 4 then both Cover ILP(fc) and 
Packing ILP(/c) remain W[l]-hard. 

Proposition 3. Let q > 4. The g-CovER TLP(k) and the (/-PACKING ILP (k ) problem are 
WflJ-hard. 

Proof. For both problems we give a reduction from Subset SuM(fc) which is W[l]-hard due to 
Downey and Fellows [8] (see also [7]). In Subset Sum(/c) we are given n non-negative integer 
values, say, a±, . . . ,a n , along with a target value t and the question is whether there exists a 
choice of at most k numbers whose sum is exactly t; the problem is parameterized by k. To 
simplify the reduction we ask for a subset with exactly k numbers (this can be ensured by 
adding k times the value for padding any subset matching the target sum to cardinality k). 
We also assume that ai<t for all i since larger values can be discarded. 

We begin with the reduction to q- Packing ILP (A;). We create the following packing ILP 
with variables x\, . . . , x n , asking for a feasible point x with l T x > k. 



x < 1 

n 

^2(t - ai)xi <{k- l)t 
i=l 

Clearly, each variable appears in exactly 4 constraints, apart from x > 0. (If variables are 
defined to be 0/1, or if range constraints xi S {0, 1} are free, then we get q > 3.) Note that 
the following equivalence holds; it uses that feasible solutions with value Yl?=i x i — ^ satisfy 
also Y17=i x i = k - 

n 

y^(t - ai)xi <(k — i)t 

i=l 



max 1 x 



n 

s 

1=1 

n 



.t. Xi < k 



y~] aiXi < t 
i=l 
x > 



t x t ~ aiXi — — i)t 

i=l i=l 
n 

kt — a i x i ^ — 1)* 
i=l 

n 
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Thus feasible solutions with value at least k satisfy also XT=i x i = k and J27=i a i x i = Since 
each Xi is either zero or one this translates directly to a choice of k numbers a» whose sum is 
exactly t. The reverse holds easily, by setting the Xi according to a choice of k numbers with 
sum exactly t. 

For g-CovER ILP(fc) the same style of ILP works, except for a caveat: We do not have the 
ability to enforce x < 1. While this could be replaced by 

> — 1 

for all j € {1, . . . ,n}, this would violate the desired bound on the column-spar seness. If the 
problem setting permits to define a range for each variable Xi then of course we can add Xi £ 
{0, 1} instead; this would give column-sparseness q = 3. Otherwise, using a derandomized form 
of color-coding (e.g., using splitters [20j), we could first show W[l]-hardness for Multicolored 
Subset Sum(/c) where the numbers are colored 1, . . . , k and the task is to match the target sum 
by picking one number of each color (the back reduction to Subset Sum(/c) is easy by encoding 
the colors into the numbers). Then we could augment our reduction by demanding that the 
sum over all indicator variables for numbers of the same color be at least one (giving column- 
sparseness (7 = 4). This naturally enforces that each variable takes value at most one, since we 
have exactly k colors. Nevertheless, since this is not the main focus of this work, we will not go 
into the details of this. For completeness we state the covering ILP with 0/1- variables. 



Xi £ {0,1} 

n 

Y^{t - ai)xi > (k — l)t 

i=l 

Correctness can be argued analogously to the case of q- Packing ILP (A:). □ 

Finally, for Cover IhP(k+r), i.e., parameterized by target solution cost plus row-sparseness, 
it is straightforward to show fixed-parameter tractability but the problem does not admit a 
polynomial kernelization by encompassing Hitting SET(fc + d), parameterized by solution size 
and maximum set size d; for the latter problem Dom et al. [6l Theorem 6] ruled out polynomial 
kernels, assuming NP ^ coNP/poly. 

Proposition 4. The Cover ILP (k + r) problem is fixed-parameter tractable but admits no 
polynomial kernel or compression unless NP C coNP/poly. 

Proof. Let us first sketch an 0(r k {n + m)°^) time fpt-algorithm following similar algorithms 
for, e.g., r-HiTTiNG Set(/c) [7J. We may assume that the cost function c T x has Cj > 1 for 
all i £ {l,...,n} (otherwise such variables x^ can be removed along with their constraints, 
as setting them to arbitrarily large values satisfies all their constraints but costs zero). The 
algorithm starts from x = and tries to find a minimal feasible solution x with c T x < k by 
branching. In each recursive call with some partial solution x, it picks an arbitrary infeasible 
constraint and branches on the choice of one of its at most r variables, and increases the value of 
that variable by one (in x). This increases the cost c T x by at least one. In each branch, this is 



n 

mm > x l 



i=l 
n 

.t. Xi>k 

i=l 
n 

y j aiXi > t 



i=l 
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repeated until either a feasible solution is found, or until we have reached the maximum budget 
of k. It is easy to prove correctness by using the invariant that each level of the branching tree 
contains at least one (possibly infeasible) solution x with x < x* (position- wise), where x* is 
an arbitrary feasible solution with cost c T x at most k (if one exists). The runtime is clearly 
dominated by the 0{r k ) leaves of the search tree (times polynomial factors). 

The reduction from Hitting Set(/c + d) to Cover ILP(fc + r) is straightforward: Make an 
indicator variable for each element of the base set and ask for a solution with sum of indicators 
at most k. For each set, demand that the sum of indicators for its elements is at least 1; this 
gives row-sparseness r = d. □ 



D. A polynomial kernelization for r-ILP Feasibility(n + C) 

Proposition 5. The r -Integer Linear Program Feasibility (n + C) problem admits a 
polynomial kernelization. 

Proof. The key fact is that there can be only few different constraints in an instance of r- 
ILPF(n + C): Any linear constraint on d < r variables, say, 

a\x ix + a 2 x i2 + . . . + adx id < ft 

is characterized by the following aspects. 

1. The choice of relation symbol {<,>,=}. 

2. The choice of d variables: (™) = 0{n d ). 

3. The values of or, . . . , an d /3. By problem restriction, the absolute values of the coeffi- 
cients are at most C, hence for each one we have 2C + 1 choices out of {— C, . . . , 0, . . . , C}. 
Thus, there are (2C + l) d+1 choices. 

Over all d < r this gives a total number of m = 0(n r C r+1 ) different constraints. Thus, 
discarding all duplicate constraints (same variables and coefficients) and using the canonical 
encoding in 0(nm log C) = 0(n r+1 C r+1 log C) bits constitutes a polynomial kernelization with 
respect to n + C. □ 



E. A polynomial kernelization for q-\LP Feasibility(m + C) 

Proposition 6. The ^-Integer Linear Program Feasibility (m + C) problem admits a 
polynomial kernelization. 

Proof. The main observation is that there is only a bounded number of different coefficient 
patterns for the variables since each variable appears only in a constant number of constraints: 

1. Each variable appears in some set of up to q constraints, giving less than (m+l) q = 0(m q ) 
choices. 

2. In each such constraint, it has a coefficient of absolute value at most C, giving (2C + l) g 
choices. 
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Now, if two variables, say Xi and Xj have the same pattern, then we delete one of them. Let 
us check that deletion of any single variable Xj for which we have another variable with the 
same pattern does no harm: It is clear that a solution for the created ILP can be extended to 
the original by setting Xj = 0. Conversely, let Xi = Oj and Xj = aj be part of a feasible point 
for the original ILP. For the created ILP, which does not contain Xj, we create a feasible point 
by letting x« = a% + cij, and letting all other coordinates be the same. It is easy to see that 
due to having the same pattern, all constraints are still satisfied (this is true for any way of 
splitting cij + a,j between Xi and Xj ) . 

In total we keep only one variable for each pattern, giving 0(m q C q ) variables. This allows 
an encoding in space 0(nm log C) = 0(m q+1 C q log C). 

A technical note: We have assumed that there is no global constraint x > that we would 
not count as separate constraints X{ > in the parameter m (by nature of writing ILPs as, 
e.g., Ax < b, x > and taking m to be the number of rows of A). If we have x > then, in the 
above argument, letting Xi = ai + dj still works (since ai,aj > 0) and, similarly, setting Xj = 
is feasible. □ 
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Figure 1: Existence of polynomial kernels for (unrestricted) Integer Linear Program Fea- 
sibility regarding parameterization by a subset of n, m, logC, and C (depicted in 
top half of nodes). "PK" indicates a polynomial kernel, "no PK" means that no 
polynomial kernel is possible unless NP C coNP/poly, and "NP-hard" stands for NP- 
hardness with constant parameter value. Positive results transfer upwards to easier 
(larger) parameterizations; negative results transfer downwards to harder (smaller) 
parameterizations. NP-hardness of ILPF(C) for C = 1 comes from Satisfiability. 
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Figure 2: Existence of polynomial kernels for (r-row-sparse) t-Integer Linear Program 
Feasibility regarding parameterization by a subset of n, m, logC, and C (depicted 
in top half of nodes). "PK" indicates a polynomial kernel, "no PK" means that 
no polynomial kernel is possible unless NP C coNP/poly, and "NP-hard" stands 
for NP-hardness with constant parameter value. Positive results transfer upwards 
to easier (larger) parameterizations; negative results transfer downwards to harder 
(smaller) parameterizations. NP-hardness of r-ILPF(C) for C = 1 comes from 3- 
Satisfiability. 
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Figure 3: Existence of polynomial kernels for (g-column-sparse) (/-Integer Linear Program 
Feasibility regarding parameterization by a subset of n, m, logC, and C (depicted 
in top half of nodes). "PK" indicates a polynomial kernel, "no PK" means that no 
polynomial kernel is possible unless NP C coNP/poly, and "NP-hard" stands for NP- 
hardness with constant parameter value. Positive results transfer upwards to easier 
(larger) parameterizations; negative results transfer downwards to harder (smaller) 
parameterizations. The lower bound for q-ILPF(m + logC) follows from a lower 
bound for Small Subset SuM(/c + <i) due to Dom et al. [6j: Expressing Subset Sum 
with weights of value at most 2 d requires only m = 0{1) constraints and the encoding 
size logC per weight of value at most C = 2 d is 0(d). NP-hardness of g-ILPF(C) 
for C = 1 comes from Satisfiability restricted to instances with variable degree at 
most three (cf. [12]). 
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